Fast Algorithm for Optimal Compression of Graphs

نویسنده

  • Yongwook Choi
چکیده

We consider the problem of finding optimal description for general unlabeled graphs. Given a probability distribution on labeled graphs, we introduced in [4] a structural entropy as a lower bound for the lossless compression of such graphs. Specifically, we proved that the structural entropy for the Erdős–Rényi random graph, in which edges are added with probability p, is ` n 2 ́ h(p)−n log n+O(n), where n is the number of vertices and h(p) = −p log p − (1 − p) log(1−p) is the entropy rate of a conventional memoryless binary source. In this paper, we prove the asymptotic equipartition property for such graphs. Then, we propose a faster compression algorithm that asymptotically achieves the structural entropy up to the first two leading terms with high probability. Our algorithm runs in O(n + e) time on average where e is the number of edges. To prove its asymptotic optimality, we introduce binary trees that one can classify as in-between tries and digital search trees. We use analytic techniques such as generating functions, Mellin transform, and poissonization to establish our findings. Our experimental results confirm theoretical results and show the usefulness of our algorithm for real-world graphs such as the Internet, biological networks, and social networks.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving Fast Charging Methods Using Genetic Algorithm and Coordination between Chargers in Fast Charging Station of Electric Vehicles in Order to Optimal Utilization of Power Capacity of Station

Fast charging stations are one of the most important section in smart grids with high penetration of electric vehicles. One of the important issues in fast chargers is choosing the proper method for charging. In this paper, by defining an optimization problem with the objective of reducing the charging time, the optimal charging levels are obtained using a multi-stage current method using a gen...

متن کامل

Optimal DC Fast Charging Placing And Sizing In Iran Capital (Tehran)

DC fast charging (DCFC) and optimal placing of them is a fundamental factor for the popularization of electric vehicles (EVs). This paper proposes an approach to optimize place and size of charging stations based on genetic algorithm (GA). Target of this method is minimizing cost of conversion of gas stations to charging stations. Another considered issue is minimizing EVs losses to find neares...

متن کامل

Optimal fast digital error correction method of pipelined analog to digital converter with DLMS algorithm

In this paper, convergence rate of digital error correction algorithm in correction of capacitor mismatch error and finite and nonlinear gain of Op-Amp has increased significantly by the use of DLMS, an evolutionary search algorithm. To this end, a 16-bit pipelined analog to digital converter was modeled. The obtained digital model is a FIR filter with 16 adjustable weights. To adjust weights o...

متن کامل

cient Lossless Compression of Trees and Graphs

In this paper, we study the problem of compressing a data structure (e.g. tree, undirected and directed graphs) in an eecient way while keeping a similar structure in the compressed form. To date, there has been no proven optimal algorithm for this problem. We use the idea of building LZW tree in LZW compression to compress a binary tree generated by a stationary ergodic source in an optimal ma...

متن کامل

Delta-K 2-tree for Compact Representation of Web Graphs

The World Wide Web structure can be represented by a directed graph named as the web graph. The web graphs have been used in a wide range of applications. However, the increasingly large-scale web graphs pose great challenges to the traditional memory-resident graph algorithms. In the literature, K-tree can efficiently compress the web graphs while supporting fast querying in the compressed dat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010